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ANALYZING AUCTION DATA HAVING A VARYING NUMBER OF BIDDERS 

BACKGROUND 

[0001] Auctions are becoming an integral part of a company's strategy for 
procurement and excess disposal. Auctions can produce substantial cost savings 
by lowering transaction and negotiation costs and reducing uncertainty. To 
remain competitive in this arena, it is desirable to make optimal auction decisions. 
These decisions may include a seller setting the optimal reserve price and a 
bidder estimating the characteristics of his competitors. Some or all auction 
decisions may be based on the joint distribution of values for various bidders. 
With this characterization of the auction market, structural variables can be 
estimated and bidder behavior can be forecast. Improved techniques for 
estimating the joint distribution of bidder values are desirable. 

BRIEF SUMMARY 

[0002] In at least some embodiments, a method and associated apparatus 
comprise organizing previously acquired auction data into a plurality of sub- 
samples, each sub-sample comprising bid data associated with auctions having a 
common number of bidders, applying an inverse bid function to at least two sub- 
samples, pooling results from applying the inverse bid function to form a first pool, 
applying a direct bid function on the first pool to generate sample bids, matching 
bids from at least one sub-sample to the sample bids, and pooling results from 
the matching with the first pool to form a second pool. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0003] For a detailed description of the embodiments of the invention, reference 
will now be made to the accompanying drawings in which: 
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[0004] Figure 1 shows an exemplary embodiment of a liistorical auction data 
set; 

[0005] Figure 2 depicts an exemplary technique in which sub-samples of 
varying sizes can be used to compute a distribution of estimated bidder values; 
[0006] Figure 3 depicts an alternative technique to that of Figure 2; and 
[0007] Figure 4 shows a system configured to implement a plurality of the 
techniques described herein. 

NOTATION AND NOMENCLATURE 
[0008] Certain terms are used throughout the following description and claims to 
refer to particular system components. As one skilled in the art will appreciate, 
computer companies may refer to a component by different names. This 
document does not intend to distinguish between components that differ in name 
but not function. In the following discussion and in the claims, the terms 
"including" and "comprising" are used in an open-ended fashion, and thus should 
be interpreted to mean "including, but not limited to..." Also, the temn "couple" or 
"couples" is intended to mean either an indirect or direct electrical connection. 
Thus, if a first device couples to a second device, that connection may be through 
a direct electrical connection, or through an indirect electrical connection via other 
devices and connections. 

DETAILED DESCRIPTION 
[0009] The following discussion is directed to various embodiments of the 
invention. Although one or more of these embodiments may be preferred, the 
embodiments disclosed should not be interpreted, or othenA/ise used, as limiting 
the scope of the disclosure, including the claims. In addition, one skilled in the art 
will understand that the following description has broad application, and the 
discussion of any embodiment is meant only to be exemplary of that embodiment, 
and not intended to intimate that the scope of the disclosure, including the claims, 
is limited to that embodiment. 

[0010] In accordance with the embodiments of the invention, an analytical 
process is provided herein that analyzes historical bid data (i.e., bid data from 
previous auctions) and processes such bid data to generate an estimated 
distribution of bid values for a given auction. In general, the process described 



108250.04/2162.13200 



200208419 



herein comprises a fully non-parametric, analytical approach that makes use of 
previously acquired auction data sets. Each data set may comprise bid data that 
pertains to one or more previously occurring auctions and auctions that may have 
a disparate number of bidders. 

[0011] The following explanation provides various equations and/or formulae 
that may be used in accordance with the techniques discussed herein. By way of 
definition, for a given set of structural variables, a direct bid function represents 
the optimal bid corresponding to all values of a specified value distribution, F(vi, 
V2, ... vn) where v represents a value for a particular bidder and n is the number of 
bidders. For ease of explanation, the number of bidders (n) represents the sole 
structural variable associated with the auction. In addition, the bid function may 
be considered to be identical for all bidders and is given by: 

b,=s{v,,F,n) (1) 
where / is an index that specifies the bidder. 

[0012] While the direct bid function expresses the bid submitted by a bidder as 
a function of the bidder^s valuation, the distribution of bidder valuations and the 
number of bidders in the auction, an inverse bid function, as represented below, 
expresses a bidder's valuation as a function of the bid he or she submits, the 
distribution of posted bids, G(bi, b2, ... bn), and the number of bidders in the 
auction: 

v,=s-\b,,G.n) (2) 
[0013] Both functions b and v above can be derived from the bidder's 
optimization problem. 

[0014] The probability of setting the winning bid is a function of the bid 
distribution. Under equilibrium, however, the probability of setting the winning bid 
is equal to the probability of drawing the highest value. Therefore, the probability 
can be rewritten in terms of the distribution of bidder values as well. The 
embodiments provided herein are provided for two cases. In the Independent 
Private Values ("IPV") case, all bidders have independent and identical marginal 
value distributions. In the Affiliated Private Values ("APV") case, the bidders' 
valuations are allowed to be statistically correlated. 
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[0015] As noted above, in the IPV case bidders have independent and identical 
marginal value distributions. The form of the direct bid function for the IPV case 
may tal<e the following form: 

6, = .(V,, F, n) = V, - ^^.^ f [F(u)r du (3) 

[0016] In the IPV case, the following fornnulation of the inverse bid function is 
applicable: 

_LG(VO (4) 
' ' n-\g(b„n) 

where G(b) is the bid distribution function and g(b) is the density function. 
[0017] The inverse-bid-function fornnulation expresses the unknown valuation of 
a bidder in terms of observable variables. The bid distribution function, G(b.n), 
and density function, g(b,n), can be estimated non-parametrically from a set of 
observed bids from a historical data set using equations (5) and (6) below: 



In equations (5) and (6), L represents the number of auctions in the data set and 
n represents the number of bidders. In equation (10), the value hg represents the 
"bandwidth" parameter that is also referred to as a "smoothing" or "regularization" 
parameter. The parameter hg controls the level of smoothness of the resulting 
density estimate. Further, the function K may be any suitable kernel function. 
[0018] Not all auction markets necessarily conform to the IPV assumption. In 
such cases, the APV model may provide more useful recommendations 
regarding the stmcture of the auction. In the APV model the joint value 
distribution is not simplified because the marginal distributions are correlated. As 
a result, the conditional value distribution is the main determinant of the bidding 
decision. Under equilibrium, the probability of setting the winning bid is given by 
^ylv, (v,lv,) where = max^.,, v . . 
[0019] The APV direct bid function is expressed as: 
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i,, = V, - l-Lia\v,)daMthUa\v,)^exp[-^^^^^du] (7) 

[0020] The inverse bid function for the APV case may be obtained by setting 

GB,itSbi\bi) = Fy,i.,is~'i^^~'(f>i)) and 5, =siy,) to obtain: 

^^s^^ (8) 

[0021] For the inverse bid function, the expressions Gg^^^ibi\bi) and 
SB,\b, (*. |^< ) estimated using the following joint analog: 



(10) 



G 8 

[0022] The formulas of (9) and (10) also may be used to calculate 
/,|v(«|")/^^,|v.("l") d'""®^* function (7). To estimate the conditional 

distribution and density, the joint fomnulas (9) and (10) provided above are divided 
by the probability of observing the conditioning factor. That is: 

[0023] The techniques described herein process data that has been acquired 
regarding previously occurring auctions. Exemplary embodiments of the 
invention permit auction data sets that include auctions having a disparate 
number of bidders to be processed. Figure I illustrates an exemplary format of an 
auction data set 50. In general, each row 52 in data set 50 corresponds to an 
auction participant's bid for a given auction. The auction is identified by an 
Auction identifier ("ID") 54 and the bidder is identified by a Bidder ID 56. The 
Auction ID and Bidder ID values 54. 56 can be any alphanumeric identifying value 
that uniquely identifies the auction and bidder. The bid entered by the bidder is 
provided in column 58 labeled "Bid." One or more auction rules 60 may be 
provided that define the characteristics of the auction. Exemplary rules 60 may 
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include any of a variety of mles such as the number of bidders permitted in the 
corresponding auction and the reserve price. 

[0024] Figure 2 illustrates an embodiment of the invention in which historical 
auction data is processed and used to generate an estimated distribution of 
values. An input data set may comprise bid data of multiple auctions with multiple 
bidders in each auction. In the embodiment of Figure 2, the data set is divided 
into groups of sub-samples (i.e., sub-sets). A first group 66 of sub-samples 
comprises "large" sub-samples 70 while a second group 68 of sub-samples 
comprises "small" sub-samples 72. Each group 66, 68 may comprise any 
number of sub-samples 70, 72. Each sub-sample 70, 72 comprises historical 
auction data of one or more auctions having the same number of bidders. That 
is, the difference between sub-samples 70 in the first group 66 and the sub- 
samples 72 in the second group 68 is that each sub-sample comprises auction 
data having a different number of bidders than for other sub-samples. The large 
sub-samples 70 may be differentiated from the small sub-samples 72 in that the 
large sub-samples 70 pertain to auctions having a sufficiently large number of 
bidders to make reliable use of the auction data. In general, a large sub-sample 
contains bid data associated with auctions that have more than a pre-specified 
number of bid observations. The small sub-samples 72 generally pertain to 
auctions having so few bidders that, without the techniques described herein, use 
of such data generally would be unreliable. In general, a small sub-sample 
contains bid data associated with auctions that have less than the pre-specified 
number of bid auctions. A sub-sample associated with an auction having the pre- 
specified number of bidders can be classified as large or small as desired. The 
pre-specified number of bid observations is application specific and thus the 
differentiation between "large" and "small" sub-samples is application specific. 
[0025] By application of equation (4) above, for the IPV case for each bidder, 
denoted by the index /, in each of the large sub-samples 70, a "pseudo-value" 80 
can be computed. The pseudo-value 80 is indicative of the value that the 
bidder associates with the item being auctioned. The input values to equation (4) 
include the number of bidders, n, in the sub-sample, the bid distribution function, 
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G(b), from equation (5). the density function, g(b), from equation (6) and the bid 
price for the bidder. 

[0026] Once the bid distribution and density functions from equations (5) and (6) 
are evaluated, the resulting G(b) and g(b) values are included in equation (4) to 
compute a pseudo-value 80 for the ith bidder. In the IPV case, the resulting 
pseudo-values 80 can be aggregated, pooled, or otherwise combined together at 
82. Although the data from the small sub-samples 72 generally is not used 
directly in non-parametric estimation, estimates for the small sub-samples 72 are 
computed by using the pseudo-values 80 from the large sub-samples 70. Using 
the IPV direct bid function from equation (3) and the large sample estimates, 
sample bids 88 are computed for a given small sub-sample 72. Pseudo-values 
86 for the sample then are computed by matching or comparing the observed 
bids 72 to the sample bids 88. The new pseudo-values 86 are also added to the 
pool of values 82 to fomn a pool of values 90 from which a value distribution can 
be computed. 

[0027] Referring still to Figure 2, in the APV case the large sample pseudo- 
values are not pooled together. Instead, the pseudo-values 80 from each large 
sub-sample 70 may be re-sampled separately to create ail combinations of value 
data conforming to the number of bidders in a given small sub-sample. The re- 
sampled data is used to calculate sample bids for the small samples 72. 
Matching extracts value estimates. Use of the APV direct bid function of equation 
(7) permits the small sub-samples to be incorporated into the estimation, and 
captures a portion of the shared infomriation between large sub-samples 70 and 
small sub-samples 72. 

[0028] Figure 3 illustrates a technique in which the historical data set is 
apportioned into one or more sub-samples 100a-100c. Each sub-sample 
includes bid data pertaining to auctions having the same number of bidders. 
Sub-sample 100a includes bid data of the auctions having the largest number of 
bidders. Sub-sample 100b includes bid data of auctions with the next largest 
number of bidders. Additional sub-samples may be included as well pertaining to 
auctions with increasingly smaller numbers of bidders. Sub-sample 100c 
includes bid data of auctions having the fewest number of bidders. Beginning 
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with the largest sub-sample (100a), the technique of Figure 3 applies the APV 
inverse bid function of (8) to sub-sample 100a to produce initial pseudo-values 
102 which are added to the pool 108. The technique then uses the initial pseudo- 
samples 102 in the APV direction bid function of (7) to calculate a set of sample 
bids 104b that correspond to the next largest sub-sample (100b). The pseudo 
values 102b associated with sub-sample 100b are obtained by matching the 
observed bids to the sample bids and are added to pool 108. The newly 
expanded pool 108 then is used to calculate pseudo-values for the next largest 
sub-sample and the process continues through the smallest sub-sample 100c. 
[0029] At the end of the estimation, the pool of values 108 contains data from 
each sub-sample and captures correlation(s) between the sub-samples. Sharing 
infomriation among sub-samples may improve the estimation. If there is a gain to 
be achieved from information sharing, the process may iterate back to the largest 
sub-sample 100a and may repeat the process one or more times until the 
resulting value estimates fail to improve. 

[0030] Looping back through the data set may be difficult in the APV case 
because sampling may rely on the pooled data being of higher dimension than 
the sub-sample in question. Several refinements, however, can be made to 
address this issue. In one embodiment, parametric representation of correlation 
may be used and the relevant parameters can be estimated using the largest 
subsample. 

[0031] Figure 4 illustrates an exemplary embodiment of a system 200 that 
implements the techniques described herein. As shown, the system 200 may 
include a processor 202 coupled to a memory 204, an input device 206. an output 
device 208 and a network interface 210. The memory 204 may comprise volatile 
memory and/or non-volatile memory and contain an auction application 220 that 
is executed by processor 202. Auction application 220 performs some or all of 
the actions described above. The input device 206 may comprise a keyboard, 
mouse, or other suitable type of input device. The output device 208 may 
comprise a display. The network interface 210 provides the system 200 with 
connectivity to a network. The historical auction data sets processed by the 
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auction application 220 may be stored in memory 204 or on the network and 
provided to the system 200 via the network interface 210. 

[0032] The technique disclosed herein by which the direct and inverse bid 
functions iteratively use all available data is applicable in other situations where 
the subsamples may be formed on the basis of variables other than the number 
of bidders in the auction. Variation in factors (such as the reserve price) create 
similar problems for data analysis. If the available data contains auctions with 
different reserve prices, then one can modify the disclosed technique by replacing 
the number of bidders with a reserve price as the basis of fomning subsamples. 
[0033] The above discussion is meant to be illustrative of the principles and 
various embodiments of the present invention. Numerous variations and 
modifications will become apparent to those skilled in the art once the above 
disclosure is fully appreciated. The following claims embrace all such variations 
and modifications. 
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